# Multimodal Chatbot
Llava V1.5 7b M3
Apache-2.0
M3 is a multimodal model that allows explicit control of visual granularity at runtime and can serve as a metric for image/dataset complexity. It is fine-tuned from LLaMA/Vicuna.
Text-to-Image
Transformers

L
mucai
33
2
Sharegpt4v 7B
ShareGPT4V-7B is an open-source multimodal chatbot model trained using GPT4-Vision-assisted data and LLaVA instruction fine-tuning data.
Text-to-Image
Transformers

S
Lin-Chen
530
82
Featured Recommended AI Models